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Abstract — Utility-based power allocation in wireless ad-hoc networks is inherently nonconvex because of the global coupling induced 
by the co-channel interference. To tackle this challenge, we first show that the globally optimal point lies on the boundary of the 
feasible region, which is utilized as a basis to transform the utility maximization problem into an equivalent max-min problem with more 
structure. By using extended duality theory penalty multipliers are introduced for penalizing the constraint violations, and the minimum 
weighted utility maximization problem is then decomposed into subproblems for individual users to devise a distributed stochastic 
power control algorithm, where each user stochastically adjusts its target utility to improve the total utility by simulated annealing. 
The proposed distributed power control algorithm can guarantee global optimality at the cost of slow convergence due to simulated 
annealing involved in the global optimization. The geometric cooling scheme and suitable penalty parameters are used to improve 
the convergence rate. Next, by integrating the stochastic power control approach with the back-pressure algorithm, we develop a joint 
scheduling and power allocation policy to stabilize the queueing systems. Finally we generalize the above distributed power control 
algorithms to multicast communications, and show their global optimality for multicast traffic. 

Index Terms — Distributed Power Control, Nonconvex Optimization, Extended Duality Theory, Simulated Annealing, Queue Stability, 
Unicast Communications, Multicast Communications. 



1 Introduction 

THE broadcast nature of wireless transmissions makes 
wireless networks susceptible to interference, which 
deteriorates quality of service (QoS) provisioning. Power 
control is considered as a promising technique to miti- 
gate interference. One primary objective of power control 
is to maximize the system utility that can achieve a 
variety of fairness objectives among users |2l, ||3l, IH, 
||5l . However, maximizing the system utility, under the 
physical interference model, often involves nonconvex 
optimization and it is known to be NP-hard, due to the 
complicated coupling among users through interference 

Due to the nonconvex nature of the power control 
problem, it is challenging to find the globally optimal 
power allocation in a distributed manner. Notably, 0, 
||8l devised distributed power control algorithms to find 
power allocations that satisfy the local optimality con- 
ditions, but global optimality could not be guaranteed 
in general, except for some special convexifiable cases 
(e.g., with strictly increasing log-concave utility fimc- 
tions). Another thread of work applied game-theoretic 
approaches to power control by treating it as a non- 
cooperative game among transmitters HI, IfTOl . However, 
distributed solutions that converge to a Nash equilib- 
rium may be suboptimal in terms of maximizing the 
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total system utility. Different from these approaches, 
[11] proposed a globally optimal power control scheme, 
named MAPEL, by exploiting the monotonic nature of 
the optimization problem. However, the complexity and 
the centralized nature of MAPEL hinder its applicability 
in practical scenarios, and thus it can be treated rather as 
a benchmark for performance evaluation in distributed 
networks. 

To find the globally optimal power allocation in a 
distributed setting, an interesting work ||T2| has pro- 
posed the SEER algorithm based on Gibbs sampling ITSlI , 
which can approach the globally optimal solution in an 
asymptotic sense when the control parameter in Gibbs 
sampling tends to infinity. Notably, for each iteration in 
the SEER algorithm, each user utilizes Gibbs sampling 
to compute its transition probability distribution for 
updating its transmission power, where the requirement 
for message passing and computing the transition prob- 
ability distribution in each iteration can be demanding 
when applied to ad-hoc communications. 

A challenging task in distributed power control in ad- 
hoc networks is to reduce the amount of message passing 
while preserving the global optimality. In this paper, 
we tackle this challenge by combining recent advances 
in extended duality theory (EDT) | [T4) with simulated 
annealing (SA) [15J. Compared with the classical du- 
ality theory with nonzero duality gap for nonconvex 
optimization problems, EDT can guarantee zero duality 
gap between the primal and dual problems by utilizing 
nonlinear Lagrangian functions. This property allows for 
solving the nonconvex problem by its extended dual while 
preserving the global optimality with distributed imple- 
mentation. Furthermore, as will be shown in Section II, 
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for the subproblem of each individual user, the extended 
dual can then be solved through stochastic search us- 
ing SA. In particular, we first transform the original 
utility maximization problem into an equivalent max- 
min problem. This step is based on the key observation 
that in the case with continuous and strictly increasing 
utility functions, the globally optimal solution is always 
on the boundary of the feasible (utility) region. Then, 
appealing to EDT and SA, we develop a distributed 
stochastic power control (DSPC) algorithm that stochas- 
tically searches for the optimal power allocation in the 
neighborhood of the feasible region's boundary, instead 
of bouncing around in the entire feasible region. 

Specifically, we first show that DSPC can achieve the 
global optimality in the underlying nonconvex problem, 
although the convergence rate can be slow (but this is 
clearly due to the slow convergence nature of SA). Then, 
to improve the convergence rate of DSPC, we propose 
an enhanced DSPC (EDSPC) algorithm that employs 
the geometric cooling schedule and performs a careful 
selection of penalty parameters. As a benchmark for 
performance evaluation, we also develop a centralized 
algorithm to search for the globally optimal solution over 
simplices that cover the utility region. The performance 
gain is further verified by comparing our distributed 
algorithms with MAPEL ini, SEER fig, and ADP [7] 
algorithms. Worth noting is that the proposed DSPC 
and EDSPC algorithms do not require any knowledge 
of channel gains, which is typically needed in existing 
algorithms, and instead they need only the Signal-to- 
Interference-plus-Noise (SINK) feedback for adaptation. 

Next, we integrate the above distributed power con- 
trol with the back-pressure algorithm f22\ and devise 
a joint scheduling and power allocation policy for im- 
proving the stability in the presence of dynamic packet 
arrivals and departures. This policy fits into the dynamic 
back-pressure and resource allocation framework and 
enables distributed utility maximization without extra 
technical conditions Il23l |25| . Then, we generalize the 
study to consider multicast communications, where a 
single transmission may simultaneously deliver packets 
to multiple recipients [161. Specifically, we extend DSPC 
and EDSPC algorithms to multicast communications 
with distributed implementation, and show that these al- 
gorithms can also achieve the global optimality in terms 
of jointly maximizing the minimum rates on bottleneck 
links in different multicast groups. 

The rest of the paper is organized as follows. In Section 
m we first introduce the system model, establish the 
equivalence between the utility maximization problem 
and its max-min form, and then develop both centralized 
and distributed algorithms for the max-min problem. 
Next, in Section |3l building on these power control 
algorithms, we develop a joint scheduling and power 
allocation policy to stabilize queueing systems. The gen- 
eralization to multicast communications is presented in 
Section H) We conclude the paper in Section |5l 



Tx, 



Hi 



in 



Rxi 





^ '< / 






TX2 






RX2 






• 
• 


/ X ^ 


• 
• 


• 


^^^^ ' 




• 


Txl 


K 




Rxl 








Fig. 1: System 


model. 





2 Power Control For Unicast Commu- 
nications 

2.1 System Model 

We consider an ad-hoc wireless network with a set 
£ = {!,..., L} of links, where the channel is interference- 
limited, and all L links treat interference as noise, as 
illustrated in Fig. [TJ Such a model of communication is 
also applicable to cellular networks 13. Each link consists 
of a dedicated transmitter-receiver pair.^ We denote by 
/i;fe the fixed channel gain between user Z's transmitter 
and user fc's receiver, and by the transmission power 
of link I with P"^^^ being its maximum power constraint. 
For static channels, the received SINR for the /th user 
with a matched filter receiver is given by 



7z(p) 



(1) 



where p = (pi,...,pl) is a vector of the users' trans- 
mission powers and ?i; is the noise power. Accordingly, 
the /th user receives the utility Ui{ji), where Ui{-) is 
continuous and strictly increasing. We assume that each 
user I's utility is zero when ji = 0, i.e., Ui{0) = 0. For 
ease of reference, some notation is listed in Table [T]^ 

2.2 Network Utility Maximization 

We seek to find the optimal power allocation p* that 
maximizes the overall system utility subject to the indi- 
vidual power constraints, given by the following opti- 
mization problem: 



maximize I];e£ ^'(^/(p)) 
subject to 0<pi < P,"'^'', Vie C 
variables {p}. 



(2) 



1. We use the terms "user" and "link" interchangeably throughout. 

2. We use bold symbols (e.g., p) to denote vectors and calligraphic 
symbols (e.g., £) to denote sets. 
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Notation 


Definition 


C, 


set ot links 


L 


total number of links 


hik 


channel gain from link I's transmitter 
to link k's receiver 


H 


link gain matrix 


Pi (in vector p) 


transmission power of link / 


ni (in vector n) 


noise power for link I 




btJNli< of Imk L 


7!(-} 


DiiNK runction or unK i 


ud-) 


utility function of link I 


xi (in vector x) 


ratio of link I's utility to the total network utility 


r; (in vector r) 


transmission rate of link / 


n(-) 


transmission rate function of link I 


a, 13 


penalty multipliers 



TABLE 1: Summary of the notations and definitions. 



Approach 


Case 1 


Case 11 


Power 


Sum Kate 


Power 


Sum Rate 


GP 


[20, 7.68] 


3.02 


[1, 0.61] 


0.98 


ADP 


[20, 6.46J 


3.10 


[1, 2J 


1.16 


MAPEL 


[20, 6.79] 


3.10 


[0, 2] 


1.22 


SEER 


[20, 6.90] 


3.10 


[0, 2] 


1.22 



TABLE 2: The performance of the existing approaches 
for Case I and IL 



In general, ||2jl is a nonconvex problem^. In particular, 
if the utility function is the Shannon rate achievable 
over Gaussian flat fading channels, namely C/i(7;(p)) = 

log(l + 7;(p)), where wi > is a weight associated 
with user /, (|2) boils down to the weighted sum rate 
maximization problem, which is known to be nonconvex 
and NP-hard [6]. Note that the weights can serve as 
the fairness measures |17| for different scenarios. In 
particular, in queueing systems, for arrival rates within 
the stability region, packet queues can be stabilized by 
solving this weighted sum rate maximization problem, 
when the instantaneous queue lengths are chosen as the 
weights. In Section |3l we will discuss how to stabilize 
the packet queues by integrating our distributed power 
control algorithms with the back-pressure algorithm. 

Let F denote the feasible utility region, where for each 
point U = (J/i, [/l) in J", there exists a power vector 
p such that Ui = Ui{"fi{p)) for all I £ C. The feasible 
utility region J' is nonconvex, and in general, finding 
the globally optimal solution to (|2j in is challenging. 
In the following example, we illustrate the geometry of 
T for the utility Ui{ji{p)) ~ wi log(l+7i(p)) and evaluate 
the solutions to l|2ll given by some existing power control 
approaches discussed in Section [T] 

Example: For the case with two links. Fig. |2] illustrates 
the nonconvex feasible utility region !F for different 
system parameters. We compare the performance of the 
existing approaches E|, IZl, HD, |l2 in Table E] 

Remarks: The solutions to 101 given by 13, Q, ini 
are either distributed but suboptimal or optimal but cen- 
tralized. In particular, [2J solves ^ by using geometric 

3. For some special utility functions Ui{.), (2) can be transformed 
into a convex problem fH. In this paper, we focus on the nonconvex 
case that cannot be transformed to a convex problem by change of 
variables. 



— Feasible Utility Region (I) 
■ Optimal Solution (I) 

— Feasible Utility Region i 
4 Optimal Solution (II) 




Fig. 2: The feasible utility region J^. Case (I): the channel 
gains are given by hn = 0.73, hi2 = 0.04, /121 = 0.03, 
and /122 = 0.89, and the maximum power are p™^^ = 
20, p™^^ = 100; Case (II): the channel gains are given 
by hii = 0.30, hi2 = 0.50, /i2i = 0.03, and /i22 = 0.80, 



= and the maximum power are P™ 



I, -P2" 



= 2. In 



both cases, the noise power is 0.1 for each link, and the 
weights are wi ~ 0.57, W2 — 0.43. 



programming (GP) under the high-SINR assumption, 
which yields a suboptimal solution to l|2ll when the 
assumption does not hold (e.g., this is the case in this ex- 
ample above). The ADP algorithm [TJ can guarantee only 
local optimality^ in a distributed manner. The MAPEL 
algorithm [11 1 can achieve the globally optimal solutions 
but it is centralized with high computational complexity. 
Compared with these algorithms, the SEER algorithm 
[;12J can guarantee global optimality in a distributed 
manner but message passing needed in each iteration 
can be demanding, i.e., each link needs the knowledge 
of the channel gains, the receiver SINK and the signal 
power of all the other links. It is worth noting that the 
performance of SEER hinges on the control parameter 
that can be challenging to choose on the fly. 



2.3 From Network Utility IVIaximization to IVIinimum 
Weighted Utility Maximization 

In order to devise low-complexity distributed algorithms 
that can guarantee global optimality, we first study the 
basic properties for the solutions to and then convert 
||2) into a more structured max-min problem. 

Lemma 2.1: The optimal solution to ^ is on the 
boundary of the feasible utility region T. 

4. The local optimal solution found by ADP happens to be globally 
optimal only in one of the cases that are illustrated in Table |2] 
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Proof: Let U* denote a globally optimal solution 
to l|2ll over and 7* denote the corresponding SINR 
that supports U*. Since f/;( ) is continuous and strictly 
increasing, proving that U* is on the boundary of J- 
is equivalent to showing that 7* is on the boundary of 
the feasible SINR region. Suppose that 7* is not on the 
boundary of the feasible SINR region, which indicates 
that there exists some point 7 such that 7/ > 7^* for all 
I E C and ji > 7^* for some I. Since [/;(•) for any Z e £ is 
strictly increasing in 7;, we have Ui{^i) > Ui{'^l) for all 
I G C and Ui{ji) > Ui{ji) for some I, which contradicts 
the fact 7* is a globally optimal solution. Hence, Lemma 
O follows. □ 
Based on Lemma IZTl if we can characterize the bound- 
ary of T, then it is possible to solve lO efficiently. Thus 
motivated, we first establish, by introducing a "contri- 
bution weight" for each user, the equivalence between 
^ and the minimum weighted utility maximization 
problem. 

Lemma 2.2: Problem ||2) is equivalent to the following 
minimum weighted utility maximization: 



maximize min;g£ ^ 
subject to 0<pi< P/"^'', VZ e £ 
< a;, < 1, V/ e C 

variables {p,x}. 



(3) 



Proof: Let t = J2iec ^i(7j(p)) denote the total utility. 
Since Ui{.) is nonnegative, we define xi G [0, 1] as a ratio 
for the contribution of user I's utility to t. Therefore, 
Ui{ji{p)) = txi and Y^iec^i = 1- Then ^ can be 
rewritten as 



maximize t 

subject to t = ^^iMPli , e £ 

Q<xi<iyiEC 
variables {p,x,t}. 



(4) 



Then, in the context of maximizing t, it suffices to relax 



in lllll as t < 



, VI e C, which is 



equivalent to t < min;g£ £l(2l(e12_ Therefore, iSll can be 
treated as the hypograph form of lO, i.e., lHJl and ^ are 
equivalent |18|, thereby concluding the proof. □ 
For given x, (|3) is quasi-convex^. By introducing an 
auxiliary variable t, we obtain the following equivalent 
formulation: 



maximize t 

subject to U^^{txi){ni + J^k^i ^kiPk) < hupi 

0<Pt< P/°^'',VZ 0<t 
variables {p,t}, 



(5) 



which can be solved in polynomial time through bi- 
nary search on t fl8l . By transforming (|2j to this more 



5. By definition, a function / : R' 
dom/ and all its sublevel sets Sc ■ 
are convex 1181. 



— )■ R is quasi-convex, if its domain 
{x G dom/|/(a;) < c}, for c G R, 



Feasible region 




Fig. 3: An illustration of the max-min problem for the 
case with two links. 



structured max-min problem (|3]l, we are able to find 
each boundary point efficiently. Then, the problem is 
reduced to finding a globally optimal x*, given which we 
can obtain a globally optimal solution, i.e., the tangent 
point of the hyperplane and J-, as illustrated in Fig. 
m Intuitively speaking, x represents a search direction. 
Once we find the best search direction x*, p* can be 
obtained efficiently by searching along the direction of 



2.4 Algorithms for Global Optimization 

In this section, we study algorithms achieving global 
optimality for First, we propose a centralized 

algorithm for which will serve as a benchmark 
for performance comparison. Then, by using EDT and 
SA, we propose a distributed algorithm, DSPC, for the 
problem ||3|. Building on this, we propose an enhanced 
algorithm EDSPC to improve the convergence rate of 
DSPC. 

1) A Centralized Algorithm 

Based on Lemma 12.11 and Lemma 12.21 we develop 
a centralized algorithm (Algorithm [U to solve the 
max-min optimization problem (|3]l under considera- 
tion. Roughly speaking, by dividing the simplex S = 
{^\J2iec^i = 1,0 < 2;/ < 1,VZ G £} into many small 
simplices, the algorithm can find the optimal point on 
the boundary of J-. 

Proposition 2.1: Algorithm [T] converges monotoni- 
cally to a globally optimal solution to lO as the approx- 
imation factor 6 approaches zero. 

Proof: For given e. Algorithm [T] divides the simplex 
^ = {x|E;e£^i = 1,0 < a;; < 1,VZ e £} into 
[1/e] simplices^. Then Algorithm [T] computes the power 
allocation p* by solving lO at x given by the center 
point of the simplex. Since the optimal search direction 
x* is in S, when the approximation factor e approaches 

6. fl/el denotes the smallest integer greater than 1/e. 
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zero. Algorithm [T] exhaustively searches every point in 
the simplex S. Therefore, Algorithm [1] can converge 
monotonically to a globally optimal solution to (|3). □ 
Remarks: In Algorithm [TJ by controlling e, one can 
obtain a solution arbitrarily close to a globally optimal 
one. Accordingly, Algorithm [T] can guarantee global 
optimality. However, the complexity of this algorithm 
can be high and this is possible only with centralized 
implementation. Algorithm 1 will be used only as a 
benchmark for performance evaluation of distributed 
algorithms. 



Algorithm 1 

Initialization: Choose the approximation factor e > 0, 
and construct the initial simplex S with the vertex set 
V = {vi, ...,vl}, where vi ~ e; and e; is the Ith unit 
coordinate vector. Let Vc = x S/ez: the center of 
S. Compute p* by solving © at the point x = Uc. 
Repeat 

1) Each simplex is divided into L subsimplices 
Si, ...,Sl. Let S* be a simplex with the vertex 
set V = {vi,...,vl}, and let v e S\V. Choose 
V = X Sie£ Then, each simplex Si is defined 
as having vertex set V\vi [J v. 

2) For each new simplex, compute p* by solving l(5]l 
at X given by the center point of the simplex. 

3) Find the current best solution to l|3|. 

Until The number of simplices is greater than 1 /e. 



2) DSPC Algorithm 

Next, we devise a distributed stochastic power control 
(DSPC) algorithm based on EDT 114J and SA HSJ. To this 
end, we first introduce auxiliary variables and use EDT 
to transform (|3]l with the auxiliary variables into an un- 
constrained problem. Then, we solve the unconstrained 
problem by using the SA mechanism. Specifically, define 
ti = IM|M) and rewrite ^ as 

minimize — min;££ ti 

subject to tixi < Ui{-fi{p)),yi e C 

<pi < Pl^'^'^yi e C ^ ' 

0<ti,0<xi < l,\fl e C 
variables {p,x, t}. 

Next, we use EDT to write Lagrangian for l|6ll as 

L(p,x,t, a,/3) = -mmifzcU + a\J2i(zcXi - l\ 

where = niax(0,?/), and a e M and /3 e 

are the penalty multipliers for penalizing the constraint 
violations. Based on EDT ||T41 , there exist finite a* > 
and /3,* > for alH e £ such that, for any a > a* 
and (3i > Pi , \f I e C, the solution to is the same as 
(|6). Note that lO does not include the power constraints, 
due to the fact that there is no coupling among the user 
powers. Therefore, the minimization of l|7ll with respect 



to the primal variables (p, x, and t) can be carried out 
individually by each user in a distributed fashion. 

The next key step is to perform a stochastic local 
search by each user based on SA. Let t;, xi and pi 
denote the primal values of the Ith user, and and x'l 
denote the new values randomly chosen by the Ith user. 
Accordingly, ijxj can be treated as a new target utility 
for the ^th user. To achieve this target utility, the Ith user 
updates p'l by 

p; = min(^^^^p,,P/»-), (8) 

where ji is the current SINR measured at the Ith user's 
receiver. Note that (O does not need any information 
of channel gains except the SINR feedback, i.e., ji. 
Since (|8) corresponds to the distributed power control 
algorithm of standard form as described in it 
converges geometrically fast to the target utility. Thus, 
we assume that each user I updates pi in a faster time- 
scale than ti and xi such that pi always converges 
before the next update of ti and xi. Let A denote 
the difference between L{pi,xi,ti\p-i,x-i,t-i,a, (3) and 
L{p'i,x'i,t[\p'_i,x_i,t_i,a,(3), where y_; is the vector y 
without the Ith user's variable. If A > 0, i.e., t[, a-J 
and p[ reduce Lagrangian (0, then they are accepted 
with probability 1; otherwise, they are accepted with 
probability cxp [y)' where T is a control parameter (it 
is also called temperature). Note that, as T decreases, 
the acceptance of uphill move becomes less and less 
probable, and therefore a fine-grained search takes place. 
It has been shown that, as T tends to according to a 
logarithmic cooling schedule, SA converges to a globally 
optimal point IT3ll , l20ll . To compute A locally by each 
user I, we assume that user / broadcasts the terms t;, xi 
and Pi{tixi — Ui{-fi{p)))^ , whenever any of these terms 
changes. 

Besides updating the primal variables, each user / also 
needs to update a and /?; to satisfy a > a* and /3i > (3^. 
Here, we apply the method given by Illl to update a 
and /3/. In particular, if any constraint is violated, a and 
/3i are updated as follows: 

a ^ a + a\J2iec^i - A' (q\ 
Pi ^ Pi + giitixi - Ui{ji))+ , "^l e C, 

where a and gi are used to control the rate of updating a 
and Pi . Thus, after initialization, a and Pi increase in pro- 
portion to the violation of the corresponding constraint, 
which may lead to excessively large penalty values. Since 
it is beneficial to periodically scale down the penalty 
values to ease the unconstrained optimization, a and pi 
are scaled down by multiplying with a random value (it 
is chosen empirically between 0.7 to 0.95)* if the penalty 
decrease condition is satisfied, i.e., the maximum violation 

7. A power control algorithm is of standard form, if the interfer- 
ence function (the effective interference each link must overcome) is 
positive, monotonic and scalable in power allocation |191. 

8. See il4l for the detailed description on the choice of these param- 
eters. 
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of constraints is not decreased after running Step 1 in 
Algorithm |2] several times consecutively, e.g., five time^. 
A detailed description of DSPC algorithm is given in 
Algorithm E] 

Proposition 2.2: The distributed stochastic power 
control algorithm (Algorithm |2]l converges monotoni- 
cally to a globally optimal solution to l|3]l, as temperature 
T in SA decreases to zero. 

Proof: For a given pair of a and f3, Algorithm |2] 
converges to a globally optimal solution to lO by using 
the logarithmic cooling schedule flSlI , l20ll . If the solution 
satisfies the constraints of it is also a globally optimal 
solution to lHJl based on EDT, i.e., current a and /3 
satisfy a > a* and Pi > /?f for all I e C (14). By 
iteratively updating a and /3, Algorithmic will converge 
to a globally optimal solution to l|3]l, when a and (3 satisfy 
a> a* and /3/ > /?f for all I e C. □ 

Remarks: The DSPC algorithm can guarantee global 
optimality in a distributed manner without the need of 
channel information. In particular, it can adapt to chan- 
nel variations by utilizing the SINR feedback. However, 
the convergence rate of DSPC is slow due to the use of 
logarithmic cooling schedule. 



Algorithm 2 Distributed Stochastic Power Control 
(DSPC) 

Initialization: Choose e > 0. Let a ~ 0, /J; = 0, V/ e C, 
and randomly choose p, x and t. 
Step 1: update primal variables 
Set T = To, and select a sequence of time epochs 
{ti,T2,...} in continuous time. 
Repeat for each user / 

1) Randomly pick and x'^ in the feasible region, 
and update pj according to l|8]l. 

2) Keep sensing the change of I3i{tixi — C^/(7/(p)))^ 
broadcast by other users. 

3) Compute A, and accept ij, xj, and with prob- 
ability 1, if A > 0, or with probability exp(^), 
otherwise. 

4) Broadcast t[ and .tJ, if and x[ are updated. 

5) For each time epoch r^, update T = To/log{i + 1). 
Until T < e. 

Step 2: update penalty variables 
For each user I, 

1) Update a and /?/ according to lO, and scale down 
a and if the penalty decrease condition is 
satisfied. 

2) Goto Step 1 until no constraint is violated. 



3) Enhanced DSPC Algorithm 

It can be seen from Algorithm |2] that it is critical to 
find the optimal penalty variables a and f3 for computing 
|[7]l. Moreover, a logarithmic cooling schedule is used to 
ensure convergence to a global optimum. To improve 
the convergence rate, we propose next an enhanced algo- 
rithm for DSPC (EDSPC) by empirically choosing the ini- 



tial penalty values ao and /3g and employing a geometric 
cooling schedule IflSlI , which reduces the temperature T in 
SA by T = ^T, < ^ < 1, at each time epoch. Compared 
with the logarithmic cooling schedule, T converges to 
much faster under the geometric cooling schedule, which 
in turn improves the convergence rate of DSPC. The 
resulting solution is given in Algorithm |3l 

We note that although EDSPC converges much faster 
than DSPC, it may yield only near-optimal solutions. 
Based on EDT, we choose ao > a* and /?o; > /^f, y I E C, 
to satisfy the optimality conditions for penalty variables. 
Obviously, by choosing large ao and f3oi, these conditions 
can be always satisfied. Nevertheless, very large penal- 
ties introduce heavy costs for constraint violations such 
that EDSPC may end up with a feasible but suboptimal 
solution. Therefore, the selection of initial penalty values 
plays a critical role in the performance of EDSPC and 
deserves more attention in future work. 



Algorithm 3 Enhanced Distributed Stochastic Power 
Control (EDSPC) 

Initialization: Choose e > 0. Let a = ao, Pi = (3oi, 

\/l G C, and randomly choose p, x and t. 

Set T = To, and select a sequence of time epochs 

{ti, r2, ...} in continuous time. 

Repeat for each user I 

1) Randomly pick t[ and in the feasible region, 
and update p\ according to (|8). 

2) Keep sensing the change of f3i{tixi — Ui{ji{p)))'^ 
broadcast by other users. 

3) Compute A, and accept tj, .tJ, and p'^ with prob- 
ability 1, if A > 0, or with probability exp(-^), 
otherwise. 

4) Broadcast t[ and xj, if t'l and x[ are updated. 

5) For each time epoch Ti, update T = ^T. 
Until T <e. 



2.5 Numerical Examples 

In this section, we evaluate the utility and convergence 
performance of Algorithms |2] and |3](DSPC'' and EDSPC). 
We consider a wireless network with six links randomly 
distributed on a lOm-by-lOm square area. The channel 
gains hik are equal to d^^, where dik represents the dis- 
tance between the transmitter of user I and the receiver 
of user k. We assume Ui{ji{p)) = log(l+7;(p)), P/"'''' = 1 
and ?i; = lO^'' for all I S C, and consider one randomly 
generated realization of channel gains given by 

" 0.3318 0.0049 0.0141 0.0021 0.0016 0.0007 " 

0.0031 0.9554 0.0063 0.0140 0.0012 0.0025 

_ 0.0155 0.0042 0.6166 0.0046 0.0108 0.0018 

0.0017 0.2188 0.0340 0.6754 0.0062 0.0215 ' 

0.0020 0.0017 0.2216 0.0042 0.2955 0.0028 

_ 0.0007 0.0079 0.0254 0.2553 0.0404 0.3025 _ 

9. The geometric cooling schedule is employed to accelerate the 
convergence rate of DSPC in the simulation. DSPC updates penalty 
values until they satisfy the threshold-based optimality condition. 



7 



15 




200 400 600 800 1000 

Iteration 

Fig. 4: Convergence performance of DSPC, EDSPC, SEER 
and ADR 



20 









1 




— X 













DSPC EDSPC SEER ADP 



Fig. 5: Comparison of the average utility performance 
(with confidence interval) of DSPC, EDSPC, SEER and 
ADP 



Fig. |4] shows how the total utility in the EDSPC 
algorithm converges over time, where we choose all 
the initial penalty values equal to 10. Also, we choose 
^ = 0.9, p = 1 and g = I, and use Algorithm [l] 
as a benchmark to evaluate the optimal performance. 
As shown in Fig. IH the EDSPC algorithm approaches 
the optimal utility, when the initial penalty values are 
carefully chosen. Moreover, the convergence rate of the 
EDSPC algorithm is much faster than DSPC, since DSPC 
continues updating the penalty values after the optimal 
solution is found for the current penalty values. Fig. 
|5] illustrates the average performance (with confidence 
interval) of DSPC, EDSPC, and SEER under 100 random 
initializations, with the same system parameters as in 
Fig. H As shown in Fig. 13 both DSPC and EDSPC are 
robust against the initial value variations. 

A comparison with the SEER and ADP is also depicted 
in Fig.|4]and|5l As mentioned in Section [ij ADP can only 
guarantee local optimality. Therefore, for nonconvex 
problems (e.g., in this example), ADP may converge to a 
suboptimal solution. As noted in |[T2). the performance 
of SEER heavily hinges on the control parameter that can 
be challenging to choose on the fly. In contrast, DSPC can 



approach the globally optimal solution regardless of the 
initial parameter selection, but the convergence rate may 
be slower. Further, EDSPC improves the convergence 
rate, but the initial penalty values would impact how 
close it can approach the optimal point. From the point 
of view of reducing the amount of message passing, in 
our algorithms each link does not need any knowledge 
of the channel gains (including its own channel gain), the 
receiver SINR of the other links and the signal power of 
the other links, which are all used in the SEER algorithm. 

3 Joint Scheduling and Power Control 
FOR Stability of Queueing Systems 

In Section|2l we studied the distributed power allocation, 
by using DSPC and EDSPC, for utility maximization 
in the saturated case. In this section, we generalize the 
study by considering a queueing system with dynamic 
packet arrivals and departures. Specifically, we develop 
a joint scheduling and power allocation policy to sta- 
bilize packet queues by integrating our power control 
algorithms with the celebrated back-pressure algorithm 

ED. 

3.1 Stability Region and Throughput Optimal Power 
Allocation Policy 

Consider the same wireless network model with L links 
as in Section |2l We assume that there are S classes of 
users in the system, and that the traffic brought by users 
of class s follows {Asi{t)}^-j^, which are i.i.d. sequences 
of random variables for all / = 1, ...,L and s ~ 1, S, 
where Agi (t) denotes the amount of traffic generated by 
users of class s that enters the link I in slot t. Let Q-^^i^{t) 
and (9^(;) (t) denote the current backlog in the queue of 
class s in slot t on the transmitter and receiver sides of 
link I, respectively. The queue length Qy(;-)(t) evolves 
over time as 

QJ(,)(i+l) - max(QJ(,)(t)-rf(<),0) + ^,(t) 

+ ^{m\T(l)=R{m).meC} '^mlOi 

(10) 

where rj'(<) denotes the transmission rate of link I for 
users of class s. The third term in i fTOll denotes the traffic 
from the other links. Assuming that the second moments 
of the arrival process {Agiit)}'^^ are finite, the queue 
length process {(3^(;)(t)}J^i forms a Markov chain. 

Let Esi = 1 be the indicator that the path of users of 
class s uses link I, and E^i = 0, otherwise. As is standard 
mi, 1221, 1^, the stability region is defined as follows. 

Definition 3.1: The stability region A is the closure 
of the set of all {V'sjf^i for which there exists some 
feasible power allocation policy imder which the sys- 
tem is stable, i.e., A = UpeP^(P)' where A(p) = 
{{V'Jf=il Ef=i Esi^^s < ri{p)yi}, and P denotes the set 
of feasible power allocation. Here tps denotes the first 
moment of {Asi{t)}'^^, i.e., the load brought by users of 
class s, and ri (p) denotes the rate of link I imder power 
allocation p. 
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For the sake of comparison, the throughput region^° T 
of the corresponding saturated case is defined as the set 
of all feasible link rates, i.e., T = {r|r; = ^((p), p G P}. In 
general, the throughput region T may be different from 
the stability region A, except for some special cases (e.g., 
in slotted ALOHA systems the throughput region and 
the stability region are the same II28I for two links and 
in a multiple-access channel the information theoretic 
capacity region is equivalent to its stability region |29| ). 

The system is stable if the arrival rates of packet 
queues are less than the service rates such that the 
queue lengths do not grow to infinity. In order to sta- 
bilize packet queues, it is critical to find the optimal 
scheduling and power allocation policy that maximizes 
the weighted sum rate given by i fTTI l. By integrating 
our power control algorithms and the back-pressure 
algorithm, we propose the following joint scheduling 
and power allocation policy (presented in Algorithmic 
to stabilize the queueing system. 

Proposition 3.1: The joint scheduling and power al- 
location policy (Algorithm |4]| can stabilize the system 
when the load {V's}f=i is strictly interior to the sta- 
bility region A, i.e., there exists some e > such that 
{i's + e}f=i e A. 

The proof is similar to that in 12311 , Il24l , and is omitted 
for brevity. 

Note that Algorithm |4] can be viewed as a single-hop 
dynamic back-pressure and resource allocation policy 
124), crafted towards solving the weighted sum rate max- 
imization problem l(TT|l . Specifically, by using the DSPC 
algorithm. Algorithm |4] can be implemented distribu- 
tively to find the globally optimal resource allocation. We 
should caution that EDSPC can be applied to improve 
the convergence rate of Stage 2 in Algorithm |4]but it may 
render a suboptimal schedule (i.e., it can not stabilize all 
possible {V's}f=i within A), due to the fact that EDSPC 
may not always find the global optimal power allocation. 

To reduce the complexity, we can consider a policy that 
computes ifTTI l every few slots, and it can be shown that 
this policy can also stabilize the system, when {V'sjf^i 
is strictly interior to the stability region A |26| , |27|. 

3.2 Numerical Examples 

In this section, we present numerical results to illustrate 
the use of Algorithm |4] for stabilizing a queueing sys- 
tem. We consider a one-hop network (i.e., E = {Esi} 
is the identity matrix) with two users (classes), where 
the channel gains are given by hn = 0.3, hi2 = 0.5, 
/i2i — 0.03, and /122 ~ 0.8, and the noise power is 0.1 for 
each link. The maximum transmission power is set to 1 
and 2 for links 1 and 2, respectively. Besides, we assume 
that the users of class s arrive at the network according 
to a Poisson process with rate As, and that the size of file 
brought by each user follows an exponential distribution 

10. Note that the feasible utility region T defined in Section |2] is the 
throughput region, when the utility function is the same as the rate 
function. 



Algorithm 4 Joint Scheduling and Power Allocation 
Policy 

Stage 1: For each link I, select a link weight according 
to wi {t) = max Df{t), where the difference of queue 

s — l,...,S 

lengths of class s is Df{t) = max((5|.(;)(<) — (5^(;)(t),0), 
if the receiver of link I is not the destination of class 
s's traffic, and Di{t) = Qt(i)' otherwise. 
Stage 2: Compute the optimal power allocation p* 
in each slot t by solving the following problem with 
DSPC algorithm 

L 

p* = argmax J2 wi(t)ri{p). (11) 
P 1=1 

Thus, the transmission rate of link I in slot t is given 
by n(p*) =log(l + 7,(p*)). 

Stage 3: Let S;* = arg max Dsi{t) denote the class 

S—1.....S 

scheduled in slot t; if multiple classes satisfy this 
condition, then is randomly chosen as one of these 
classes. Then, schedule these classes according to the 
solution given by Stage 2. 



with mean Vg . The load brought by users of class s is then 
-0s = XgVs- For this example, we also study the stability 
region A and compare it with the throughput region T 
of the corresponding saturated case as illustrated in Fig. 
m The stability region follows from the union of link 
rates that are conditioned on whether the other link is 
backlogged or not l28l , l29l . First, we derive the stability 
region for the given power allocation. Then, we vary 
power allocation in the feasible region, and by taking the 
envelope of these regions, we obtain the overall stability 
region shown in Fig. |6l However, different from the 
previous cases, where the throughput region is the same 
as the stability region, e.g., in a slotted ALOHA system 
with two links [28] and in a multiple-access channel 
l29l , in our case under the SINK model, the throughput 
region F is strictly smaller than the stability region (due 
to the underlying nonconvex optimization problem), as 
observed from Fig. |6l which is the convex hull of F, 
i.e., Co{T), achievable by timesharing across different 
transmission modes^^. 

Then, we vary the arrival rate A and the average file 
size v to change the traffic intensity ip = Assuming 
that the arrival rate and the average file size of each 
user are the same, we compare the sample paths of each 
user's queue length for = 1 (^ = 1/ = 1) with 
= 1.5 (A = 1.5, V = 1) in Fig. When = 1, 
which falls in the stability region shown in Fig. |6l the 
system is stabilized by using Algorithm 31 while, when 
ip = 1.5, which is outside the stability region, the system 
becomes unstable. Fig. |8l illustrates the average delay of 
the system as a function of the arrival rates. The delay 
is finite for small loads and grows unbounded when the 

11. The transmission mode is defined as the transmission rate pair 
within the throughput region JF. 
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Fig. 7: Comparison of sample paths of a user's queue 
length for different traffic loads. 



loads are outside the stability region. 

4 Power control for Multicast Commu- 
nications 

Due to wireless multicast advantage fl6l , multicasting 
enables efficient data delivery to multiple recipients 
with a single transmission. In this section, we extend 
the distributed stochastic power control algorithms in 
Section |2] to support multicast communications. 

4.1 System Model 

Beyond the model described in Section |2l we consider 
that each user / has one transmitter and a set A4i of 
receivers. The corresponding transmission rate, r/, is de- 
termined by the bottleneck link among these transmitter- 
receiver pairs, i.e., r; = minm^M, ^im, where rim denotes 
the link rate between the transmitter of user I and its 
receiver m, and it is calculated based on the Shannon 



rate log(l + 7;m(p)) for Gaussian, flat fading channels. 
Here, we do not consider the general broadcast capacity 
region but rather focus on maximizing the bottleneck 
link rates. 

4.2 Network Utility Maximization 

We seek to find the optimal power allocation p* that 
maximizes the overall system utility subject to the power 
constraints in multicast commimications, as follows: 

maximize J2iec^ii^0 

subject to n lammeMi n^n,^ I e C 

rim. = log(l +7;m(p)),V I e C,m e Ml 

0<Pi< P^'"'.^^ e jC 
variables {p, {n}, 

(12) 

Similar to l|2ll, (1211 is nonconvex due to the complicated 
interference coupling between individual links. In order 
to devise distributed algorithms to solve |[T2t , it suffices 
to relax r; = minmeMi rim in lO as < log(l + 
7/>ri(p))iV I e C,m e Ml. Thus, lO can be rewritten 
as 

maximize 

subject to n < log(l + 7zm(p)), V I e jC,m e Mi 

0<Pi< Pr'^'^yi e £ 
variables {p,r}. 

(13) 

4.3 Distributed Global Optimization Algorithms 

We develop next distributed algorithms that can find the 
globally optimal solutions to |(T3] | based on EDT and SA. 
To this end, we first rewrite the optimization problem 
lHUl as 



minimize - E/ec (''i ) 
subject to ri < log(l + 7;m(p)), V I € jC,m e Mt 

o<Pi< Pr'''',\/i e C 

variables {p, i"}- 

( 



(14) 
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Next, we use EDT to write Lagrangian for i fMl l as 

where € M are the penalty multipliers. Based on 
EDT, there exist finite > for alU G £, m G A^i such 
that, for any aim > ctim' V ? G £, m G Mi, the solution to 
([15) is the same as ((T4] l fT4j. Since there is no coupling 
among the power constraints of the individual users, (|T5] | 
does not include the power constraints. Thus, each user 
satisfies its own power constraint while minimizing dlSt 
in a distributed operation. 

As in Section 121 the key step is to let each user perform 
a local stochastic search based on SA. Let r; and pi denote 
the primal values of the Zth user, and denote the 
new values randomly chosen by the ?th user, which is 
treated as a new target transmission rate for the Zth user. 
Different from the unicast commimications case, the Zth 
user updates pj by 

pi='^"^( n.in:;'^;„„ p^-pr^). (i6) 

where ^im is the current SINR measured at the receiver 
m of user I. Note that |[T6l l does not need any information 
of the channel gains except the SINR feedback from the 
intended receivers, i.e., 7;,„. Since llT6l l is in standard 
form as described in [19], it converges geometrically 
fast to the target utility. The steps to update ri and 
aim are similar to DSPC Algorithm [2] in Section [2 A 
detailed description of DSPC algorithm for multicast 
communications is presented in Algorithm [5] 

Proposition 4.1: The distributed stochastic power 
control algorithm for multicast communications (Algo- 
rithm [5|l converges to a globally optimal solution to (l3)l , 
as temperature T in SA approaches zero. 

Proof: The proof is based on EDT and SA, and 
follows similar steps used in the proof of Proposition 
12.21 and it is omitted here for brevity. □ 

Likewise, to improve the convergence rate, we also 
propose an enhanced algorithm for Algorithm [5] by 
empirically choosing the initial penalty values and em- 
ploying a geometric cooling schedule. The resulting al- 
gorithm is given in Algorithm [6] Similar to the unicast 
case. Algorithms [5] and [6] do not need any knowledge 
of channel information (or the bottleneck link) and they 
are dynamically updated by the SINR feedback from the 
intended receivers. 

4.4 Numerical Examples 

In this section, we evaluate the performance of Algo- 
rithms [5] and [6] for multicast communications. We con- 
sider a wireless network with four transmitters and each 
transmitter has two receivers. These transmitters and 
receivers are randomly placed in a lOm-by-lOm square 
area. The channel gains him are equal to where 
dim represents the distance between the transmitter I 
and the receiver m. The channel gains him are equal 
to d^^, where dim represents the distance between the 



Algorithm 5 DSPC for Multicast Communications 



Algorithm 6 EDSPC for Multicast Communications 

Initialization: Choose e > 0. Let aim = ci^m,,, V I G 

C,m & Ml and randomly choose r and p. 

Set T = To, and select a sequence of time epochs 

{ti, T2, ...} in continuous time. 

Repeat for each user / 

1) Randomly pick r^, and update p'l according to 

CS). 

2) Keep sensing the change of X/meAii ~ 
log(l + 7(to(p)))^ broadcast by other users. 

3) Let A be the difference between 
L{p,ri\r^i,{aim}) and L(p', rj|r_(, {«;,„}), 
and accept r'^ and with probability 1, if A > 0, 
or with probability exp(-^), otherwise. 

4) Broadcast Ui{r[), if r[ is accepted. 

5) For each time epoch Ti, update T = ^T. 
Until T <e. 



transmitter I and the receiver m. We assume Ui{ri) = ri, 
pmax ^ ^^^^^ ^ ;^Q-4 fQj. alU G £ and m G 

Ml- Fig. [9] illustrates the fast convergence performance 
of Algorithms [5] and [6] in multicast communications.^^ 
Besides, we examine the average performance (with 
confidence interval) of DSPC and EDSPC for multicast 
communications under 100 random initializations with 
the same system parameters as in Fig. [9] As illustrated 

12. The other existing algorithms have been specifically designed for 
unicast communications; therefore, they are excluded here from the 
performance comparison. 



Initialization: Choose e > 0. Let aim = 0,^ I e C,m e 
A4i and randomly choose r and p. 
Step 1: update primal variables 
Set T = To, and select a sequence of time epochs 
{ti, T2, ...} in continuous time. 
Repeat for each user I 

1) Randomly pick r^, and update p\ according to 

m- 

2) Keep sensing the change of EmeA4, "'"'(^' ~ 
log(l + 7;m(p)))^ broadcast by other users. 

3) Let A be the difference between 
T(p, n|r_;,{ai,„}) and T(p', rj|r_;, {a;™}), 
and accept and with probability 1, if A > 0, 
or with probability exp(^), otherwise. 

4) Broadcast Ui{r\), if r'l is accepted. 

5) For each time epoch x;, update T = To/ log(i + 1). 

Until T < e. 
Step 2: update penalty variables 
For each user I, 

1) Update aim ^ aim + Qim{n - log(l + 7im(p)))"^, 
and scale down aim, if the condition of penalty 
decrease is satisfied. 

2) Goto Step 1 until no constraint is violated. 
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in Fig. [lOl both algorithms |5] and |6] are robust against the 
initial value variations. 

5 Conclusion 

We studied the distributed power control problem of op- 
timizing the system utility as a fimction of the achievable 
rates in wireless ad hoc networks. Based on the observa- 
tion that the global optimum lies on the boundary of the 
feasible region for unicast communications, we focused 
on the equivalent but more structured problem in the 
form of maximizing the minimum weighted utility. Ap- 
pealing to extended duality theory, we decomposed the 
minimum weighted utility maximization problem into 
subproblems by using penalty multipliers for constraint 
violations. We then proposed a distributed stochastic 
power control (DSPC) algorithm to seek a globally opti- 
mal solution, where each user stochastically announces 
its target utility to improve the total system utility via 
simulated annealing. In spite of the nonconvexity of the 



underlying problem, the DSPC algorithm can guarantee 
global optimality but only with a slow convergence 
rate. Therefore, we proposed an enhanced distributed 
algorithm (EDSPC) to improve the convergence rate 
with geometric cooling schedule in simulated annealing. 
We then compared DSPC and EDSPC with the existing 
power control algorithms and verified the optimality and 
complexity reduction. 

Next, we proposed the joint scheduling and power 
allocation policy for queueing systems by integrating 
our distributed power control algorithms with the back- 
pressure algorithm. The stability region was evaluated, 
which is shown to be strictly greater than the through- 
put region in the corresponding saturated case. Beyond 
unicast commimications, we generalized our power con- 
trol algorithms to multicast communications by jointly 
maximizing the minimum rates on bottleneck links in 
different multicast groups. Our distributed stochastic 
power control approach guarantees global optimality 
without the need of channel information, while reducing 
the computation complexity, in general systems with 
unicast and multicast communications, and applies to 
both backlogged and random traffic patterns. 
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